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In a scenario where two parties share, act on and exchange some physical resource, the assumption 
that the parties’ actions are ordered according to a definite causal structure yields constraints on 
the possible correlations that can be established. We show that the set of correlations that are 
compatible with a definite causal order forms a polytope, whose facets define causal inequalities. 

We fully characterize this causal polytope in the simplest case of bipartite correlations with binary 
inputs and outputs. We find two families of nonequivalent causal inequalities; both can be violated 
in the recently introduced framework of process matrices, which extends the standard quantum 
formalism by relaxing the implicit assumption of a fixed causal structure. Our work paves the way 
to a more systematic investigation of causal inequalities in a theory-independent way, and of their 
violation within the framework of process matrices. 


I. INTRODUCTION 

In our common understanding of the world, we typic¬ 
ally perceive events as being embedded in some causal 
structure, where events happening earlier can influence 
events happening later but not vice versa. Correlations 
can be established in such a picture by physical systems 
that may be shared or exchanged by different parties, 
and which may be used to communicate or convey causal 
influences. 

It is well known, however, that this view is challenged 
by quantum correlations: Bell’s theorem [1] shows for 
instance that these conflict with Reichenbach’s common 
cause principle [2, 3], so that quantum mechanics forces 
us to generalize the notion of causal influence [4-9] . An¬ 
other implication of this picture is that if one assumes 
that the parties interact only once with the physical me¬ 
dium, then only one-way influences (i.e., one-way signal¬ 
ing) are possible, which restricts—independently of any 
assumptions on the physics of the involved systems—the 
possible correlations that can be observed. 

But is this view that events should comply with a 
definite causal structure, and causal influences can only 
be unidirectional, necessary in any physical theory? Or 
could one envisage theories where the causal relations 
between events are not necessarily well defined [10, 11]? 
To answer these questions, Oreshkov, Costa and Brukner 
developed the framework of process matrices as an exten¬ 
sion of quantum theory, where the assumption of a fixed 
causal structure is relaxed [12], Process matrices describe 
the physical resource that allows different parties to es¬ 
tablish correlations. Oreshkov et al. showed that certain 
so-called causally nonseparable process matrices indeed 
do not comply with a definite causal structure. 

The incompatibility of a certain causally nonsepar¬ 
able process matrix with a definite causal structure was 
proven in Ref. [12] by its ability to generate correlations 
that are incompatible with a definite causal order, as 


demonstrated by the violation of a so-called causal in¬ 
equality. This can be tested in a device-independent man¬ 
ner, by just looking at the statistics observed in an ex¬ 
periment. It was recently shown that causal nonseparab¬ 
ility could also be detected in a device-dependent manner 
by using so-called causal witnesses [13]. This approach 
is more powerful as it can detect all causally nonsepar¬ 
able process matrices, while not all causally nonseparable 
process matrices can violate a causal inequality [13, 14]. 
Furthermore, physical implementations of certain (mul¬ 
tipartite) causally nonseparable process matrices, and of 
corresponding causal witnesses that detect their causal 
nonseparability, have been proposed [13-15] and even 
realized experimentally [16], while it is still not known 
whether there actually exist any physically realizable pro¬ 
cess that violates a causal inequality. Nevertheless, the 
device-independent approach is still of interest as it re¬ 
laxes the requirement to trust the functioning and the 
operations implemented by one’s devices in an experi¬ 
ment. It is furthermore also theory-independent: causal 
inequalities can in principle be tested, and correlations 
with no definite causal order can be identified whatever 
the description of the physical resource is—whether we 
use the process matrix framework or any other theory to 
be discovered in the future. A related open question is 
whether the ability to violate causal inequalities can—in 
analogy with Bell nonlocality [17] be exploited as a re¬ 
source, just like causally non-separable process matrices 
provide advantages for information-theoretical [18] and 
computational [19] tasks. 

Our paper aims at providing a better understanding 
of the device-independent characterization of correlations 
that are compatible with a definite causal order or not. 
We show that bipartite correlations with a definite causal 
order form a convex polytope, whose facets correspond 
to causal inequalities (Section II). We characterize this 
causal polytope in the simplest scenario where the two 
parties observe correlations with binary inputs and out- 
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puts, which gives us two families of new causal inequal¬ 
ities. We then investigate their possible violation in the 
framework of process matrices, and find that these can 
indeed be violated (Section III). This provides an ex¬ 
ample of “noncausal” process matrix correlations in a 
simpler scenario than that considered in Ref. [12], where 
one party had two input bits, or in Refs. [20, 2 ], where 
more parties were involved. 

II. CORRELATIONS 
WITH DEFINITE CAUSAL ORDER 

A. “Causal correlations” 

We consider an experiment with two parties, Alice and 
Bob, each of them having control over some closed labor¬ 
atory. They both open their lab, let some physical system 
in, interact with it and send a physical system out, only 
once during each run of the experiment. Alice and Bob 
are given some classical inputs labeled by x and y, and 
return some classical outputs a and 6, respectively. We 
assume that all inputs and outputs have a finite number 
of possible values. The correlation that Alice and Bob 
establish in such an experiment is described by the joint 
conditional probability distribution p{a , b\x, y ). 

In a situation where at each run of the experiment 
Alice’s events precede Bob’s events (denoted A -< B), 
Alice could send her input and output to Bob, but not 
vice versa; hence, there cannot be any signaling from Bob 
to Alice, and their correlation, which we shall denote in 
this case p A<B , must therefore satisfy 

V x, y, y', a, p AAB {a\x, y) = p AAB {a\x, y '), (1) 

with p A<B {a\x, ?/')) = J2bP A ^ B k| x , U^)- Similarly, 
in a situation where Bob’s events precede Alice’s (B -< 
A ), their correlation p B<A must satisfy the no-signaling- 
to-Bob constraint 

V x, x 1 , y, b, p B ^ A (b\x,y) = p B ^ A {b\x',y) , (2) 

with p B<A (b\x('\ y) = J2 a P BAA ( a ’ b\x^'\ y)- Note that 
non-signaling correlations satisfy both Eqs. (1) and (2), 
and are compatible with both causal orders A -< B and 
B -< A. More generally, if the correlation is compatible 
with the causal order A -< B with probability q , and with 
B -< A with probability 1 — q, then the correlation will 
be of the form 

p(a,b\x,y) = qp AAB {a,b\x,y) + (l-q)p BAA (a, b\x, y). 

( 3 ) 

Following Refs. [13, 14, 22], we call the bipartite prob¬ 
ability distribution p(a,b\x,y) (or the correlation it de¬ 
scribes, equivalently) “causal” if it can be written as 
in Eq. (3), with q £ [0,1] and p A ^ B and p B ^ A valid 
(i.e., nonnegative and normalized) probability distribu¬ 
tions satisfying Eqs. (1) and (2), respectively. Causal 


correlations are those that can be obtained in a situ¬ 
ation where every run of the experiment is compatible 
with a definite causal order {A -< B or B -< A), which 
may however vary for each run, and is only determined 
probabilistically. Note that the decomposition (3) is in 
general not unique, as non-signaling contributions can be 
included in either p A<B or p B ^ A . 


B. Causal polytopes and causal inequalities 

Correlations that are compatible with the causal or¬ 
der A -< B satisfy nonnegativity (p A ^ B (a, b\x, y) > 0 
Vx,y,a,b) and normalization (X] a bP A ^ B ( a i b\ x , y) = 1 
Vx, y) constraints, together with the no-signaling-to-Alice 
constraint (1). As these constitute a finite number of 
linear constraints on a bounded probability space 1 , it 
follows that the set of correlations p A<B is a (convex) 
polytope [23]. Similarly, the set of correlations p B<A 
that are compatible with the causal order B -< A is also 
a polytope. Now, according to Eq. (3), the set of causal 
correlations is simply the convex hull of the sets of correl¬ 
ations p A<B and p B<A 1 and is therefore itself a polytope, 
which we call the causal polytope. 

By construction, the extremal points of the causal 
polytope are extremal points of either the polytope of 
p A ^ B correlations, or of the polytope of p B<A correla¬ 
tions (or of both polytopes); in Appendix A we show 
that these correspond to deterministic correlations com¬ 
patible with either causal order (or both, in the case of 
nonsignaling correlations). From this “V-representation” 
of the causal polytope in terms of its vertices, for a given 
number of inputs and outputs, one can in principle de¬ 
termine its equivalent “'H-representation” in terms of its 
facets [23] (although in practice, this is a hard prob¬ 
lem to solve when the number of inputs and outputs in¬ 
crease). Some of its facets are trivial, in the sense that 
they only correspond to the nonnegativity constraints 
p(a,b\x,y) > 0; its other, nontrivial facets define so- 
called causal inequalities [12]— inequalities that are sat¬ 
isfied by any causal correlation. 

The above characterization hints of course at a strong 
analogy with Bell inequalities, which may be obtained 
as facets of the “local polytope” [1, 17, 24] (or may not; 
in the same way that not all Bell inequalities are facets 
of the local polytope, not all causal inequalities are fa¬ 
cets of the causal polytope, as they can also correspond 
to some external hyperplanes 2 ). Causal inequalities are 


1 The probability space can for instance be understood geomet¬ 
rically as the set of points in a high enough dimensional space, 
whose coordinates are the values p(a,b\x,y). Clearly, the non¬ 
negativity and normalization constraints make it bounded. 

2 E.g., one can check that the original causal inequality of Ref. [ 2] 
is not a facet of the causal polytope for 1 input bit for Alice, 2 
input bits for Bob, and 1 output bit for each of them (the 320 ver¬ 
tices of the causal polytope that saturate that inequality, out of 
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written as linear combinations of the conditional probab¬ 
ilities p(a, b\x,y), constrained by some “causal bounds”. 
They can also be translated in the language of “causal 
games” by considering for instance the linear combina¬ 
tion to define the score, or possibly the probability of 
success (for some specific distribution of inputs), of some 
game. They can be tested experimentally in a device¬ 
independent way— i.e., by just considering the observed 
statistics, without making any assumptions on the func¬ 
tioning of the physical devices used in the experiment: a 
violation of a causal inequality guarantees that the ob¬ 
served correlation is incompatible with a definite causal 
order—or, in short, is noncausal. 


C. The simplest causal polytope 

To illustrate the previous discussion, we now turn 
to the characterization of the simplest nontrivial causal 
polytope. Note that causal inequalities can only be non¬ 
trivial if each party has nontrivial inputs and outputs— 
i.e., if they can take at least two different values. In¬ 
deed, if one party only has trivial inputs or outputs, then 
clearly either (1) or (2) holds, so that any correlation is 
compatible with a definite causal order. 

Hence, the simplest candidate for a nontrivial causal 
polytope is the case with a single bit of input and output 
for each of the two parties * * 3 (which we shall denote by 0 
or 1), reminiscent of the scenario considered by Clauser- 
Horne-Shimony-Holt (CHSH) in the case of nonlocal¬ 
ity [26]. We generated the list of its 112 deterministic 
vertices (see Appendix A), and enumerated its 48 facets 
using the software lrs [27]. 

16 of these facets are trivial, corresponding to the non¬ 
negativity constraints p(ab\xy) > 0. By relabeling the 
inputs and outputs, the 32 remaining, non-trivial facets 
can be grouped in two non-equivalent families of causal 
inequalities: 16 facets are relabelings of the inequality 

\ 22 5 a , y S b ,x p(a,b\x,y) < (4) 

x,y,a,b 

where is the Kronecker delta, while the last 16 facets 


5056 vertices, only span a 21-dimensional affine subspace, while 

a facet of this 24-dimensional polytope should have dimension 
23). Likewise, the causal inequalities (21-24) below are not fa¬ 
cets of the causal polytope for binary inputs and outputs (they 
are only facets of its projection onto the plane considered in Sub¬ 
section III C). 

3 Actually, one also has a nontrivial causal polytope in a scenario 

where one of Alice and/or Bob’s input yields a binary output, 
while the other always gives the same output (or has no output, 
equivalently). In such a case the only nontrivial causal inequal¬ 
ities are of the LGYNI type, Eq. (5) or (7) (note on the other 
hand that the corresponding local polytope is trivial [25]). For 
simplicity however, we choose to impose throughout the paper 
that all inputs should have the same number of outputs. 


are relabelings of the inequality 

Y _ r 3 

^ ^s(k®t),0 P( a i j) — ) (5) 

x,y,a,b 

where ® denotes addition modulo 2. 

The causal inequality (4) can be interpreted as a bound 
on the maximal probability of success for a bipartite 
“guess your neighbor’s input” (GYNI) game [28] with 
uniform input bits x,y (such that p(x,y) = j), where 
Alice and Bob’s task is to guess each other’s input, i.e., 
to output a = y and b = x. Implicitly assuming uni¬ 
form input bits 4 , inequality (4) can indeed be written in 
a more compact form as 

Pgyni := p(a = y,b = x) < - . (6) 

This causal bound on the probability of success pgyni 
can easily be understood: assuming that the correlation 
is compatible with the causal order A -< B, Alice cannot 
know anything about Bob’s input bit and can therefore 
only make a random guess, so that p(a = y) = ^ and 
therefore p(a = y,b = x) < a similar reasoning holds 
for the causal order B -< A, and a convex mixture cannot 
increase the bound on pgyni- 

Similarly, the causal inequality (5) can be interpreted 
as a bound on the maximal probability of success for 
what we shall call a “lazy GYNI” (LGYNI) game, still 
with uniformly random input bits, where Alice and Bob’s 
task is now to guess each other’s input only when their 
respective input is 1 (for an input 0, their output can 
be arbitrary). Implicitly assuming uniform input bits, 
inequality (5) can then also be written in a more compact 
form as 

3 

Plgyni := p(x{a ffi t/) = 0, y{b © x) = 0) < -. (7) 

This causal bound on the probability of success plgyni 
can also easily be understood with a similar reasoning 
as above (taking into account that Alice for instance is 
only asked to guess Bob’s input half of the time, when 
her input is 1). 

III. PROCESS MATRIX CORRELATIONS 
WITH NO DEFINITE CAUSAL ORDER 

In this section we study the violation of our 
simplest causal inequalities in the framework of process 
matrices, introduced recently by Oreshkov, Costa and 
Brukner [12]. Let us first start with a brief overview of 
this framework. 


4 Note that the assumption of uniform inputs is only necessary to 
justify the shorthand notation p(a = y,b = x) for the left hand 
side of Eq. (4), and to interpret it as the success probability for 
the GYNI game. Whether an inequality written as a combination 
of conditional probabilities (like (4) or (5) for instance) defines 
a causal inequality or not depends of course in no way on the 
distribution of inputs. 
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A. The process matrix framework 

The basic assumption of the framework is that 
quantum theory correctly describes what happens locally 
in Alice and Bob’s laboratories; however, no assumption 
is being made about the global causal structure in which 
the parties operate. 

More specifically, it is assumed that Alice and Bob can 
perform any operation allowed by the standard formula¬ 
tion of quantum theory, as described by quantum in¬ 
struments [29] from some input Hilbert spaces T~L Ai and 
'H Bi (for Alice and Bob, respectively) to some output 
Hilbert spaces 'h A ° and T-L Bo . An instrument is a set 
of completely positive (CP), trace non-increasing maps 
from C(H Xl ) to C('H X °) (for X = A, B ), where C(H Xl ) 
and C('H X °) are the spaces of linear operators over the 
Hilbert spaces 'H Xl and 'H x °. Each CP map of a given 
instrument is associated with a given classical output, 
which we shall again denote by a and b for Alice and 
Bob, and all CP maps of an instrument must sum up 
to a trace-preserving map. The various instruments that 
the parties can choose to apply shall be labeled by some 
classical “inputs” x and y. 

Using the Choi-Jamiolkowski (CJ) isomorphism [30, 
31], one can represent Alice’s maps as some operators 5 
Ma\ x A ° on the tensor product space C(H Ai ®'H A °). The 
conditions for the collection of operators {M A ^ x A °} a (for 
some fixed input x) to be a valid instrument translate to 

M& Ao > 0 Va and tr Ao ^M^°=l^, (8) 

a 

where tr Ao denotes the partial trace over Alice’s output 
system, and \ Al is the identity operator in Alice’s input 
Hilbert space. Similarly, Bob’s maps can be represented 
as operators M B y B ° on 0 T-L B °), and a collection 

of operators {M B ^ B °}b (for some fixed input y) must 
satisfy analogous constraints to Eq. (8) to be a valid in¬ 
strument. 

As shown in Ref. [12], the assumption of local consist¬ 
ency with quantum theory implies that the probability 
p(a,b\x,y) of observing the classical outputs a,b for a 
choice of instruments labeled by x,y is a bilinear func¬ 
tion of Alice and Bob’s maps, which can be written as 

p(a, b\x, y) = tr [(M^° 0 M B ^ B °) ■ W] , (9) 

for some hermitian matrix W G C(H Ai 0 H A ° 0 U Bl 0 
H b °). Requiring that the probabilities given by (9) 
are nonnegative and normalized for all possible choice 
of quantum operations (including operations involving 


5 Throughout the paper, superscripts on operators refer to the 
Hilbert space they act on. 


possibly entangled ancillary systems) imposes some re¬ 
strictions on the possible W matrices [12]. As shown in 
Ref. [13], these constraints can be expressed as follows: 


W > 0 , (10a) 

tr W = d Ao d Bo , (10b) 

BrBoW = AoBtBoW , (10c) 

A lAo W = Ai A 0 B 0 W , (10d) 

w = Bo W + Ao W - AoBo W, (lOe) 


where the last three conditions are written using the op¬ 
eration x ■ defined by 

t x 

xW = — ®ti x W ( 11 ) 

dx 

for X = Ai,Ao,Bi,Bo, with 1 A and tiyy denoting the 
identity operator and the partial trace over the Hilbert 
space H x , respectively, and dx denoting its dimension. 

Operators W that satisfy these conditions are called 
process matrices. They represent the most general way 
to “connect” the output spaces 'h A ° 0'H S ° to the input 
spaces 'H Al 0 V. Bl (see Fig. 1) in a way that is locally 
consistent with quantum theory. While these conditions 
do not impose a global causal order a priori and there¬ 
fore allow in general for two-way signaling, the nonneg¬ 
ativity and normalization conditions on the probabilities 
guarantee that no logical paradoxes, like the grandfather 
paradox for instance [32, 33], appear. In the following 
we will refer to correlations of the form (9), with Alice 
and Bob’s instruments satisfying Eq. (8) (together with 
its analogous form for Bob) and W satisfying the condi¬ 
tions (10), as process matrix correlations. 



Figure 1. A process matrix W represents the physical resource 
which connects Alice’s ( ’H A °) and Bob’s ( H B °) output Hil¬ 
bert spaces to their input Hilbert spaces (H Al , 'H Bi ) in such a 
way that what happens in Alice and Bob’s labs is locally con¬ 
sistent with quantum theory [12]. Process matrices generalize 
in particular quantum states and quantum channels. 
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B. Violation of the simplest causal inequalities 
by process matrix correlations 


It was shown in Ref. [ 2] that certain process matrices 6 
could generate correlations with no definite causal order. 
A specific process matrix and specific instruments were 
indeed found, which violate a particular causal inequal¬ 
ity with one input bit for Alice and two for Bob, and 
one output bit for each. Remarkably, one of Bob’s input 
bits could be used to distinguish some runs of the ex¬ 
periment where signaling happened in one direction, and 
some runs where it happened in the other direction. It re¬ 
mained an open question, whether this special input bit 
for Bob was necessary to obtain noncausal correlations 
in the process matrix framework, or whether any simpler 
causal inequality (with fewer inputs) could be violated. 
Here we answer this question positively, by exhibiting vi¬ 
olations of both our GYNI and LGYNI inequalities (6, 
7) by process matrix correlations. 

Let us start with a simple example with two- 
dimensional input and output systems—“qubits”—for 
Alice and Bob (i.e., cLa, = d,A 0 = dsj = c?b 0 = 2). 
One can check that the matrix 




Z Al Z A ° Z Bl t B ° + Z Al t A °X Bl X B °" 


V2 


( 12 ) 


where Z and X are the Pauli matrices and where tensor 
products are implicit, satisfies the constraints (10), so 
that it defines a valid process matrix. We choose Alice 
and Bob’s operations to be the same, defined by 


|0)(0|. With these definitions, one can calculate the suc¬ 
cess probabilities of the GYNI and LGYNI games using 
Eqs. (6, 7)— or more explicitly (4, 5)— and Eq. (9). One 
finds 


5 

( 1 \ 

1 



PGYNI = Ygl 


« 0.5335 > - , 

2 


(17) 

5 

( 1 \ 

1 

3 


PLGYNI = — 1 

+ 

+ - « 0.7835 > 
4 

4 ’ 

(18) 


which indeed violate the causal inequalities (6, 7). 

One may now wonder, what the largest possible viol¬ 
ation of these two causal inequalities by process matrix 
correlations is. To optimize the violations for some in¬ 
put and output Hilbert spaces of a given dimension, we 
used a See-Saw algorithm inspired by that of Werner and 
Wolf [34], as described in Appendix B. Note that because 
the optimization problem is nonconvex, the algorithm is 
not guaranteed to converge to a global maximum. Never¬ 
theless, for small enough dimensions (at least, for qubits), 
the repeatability of our results for different random start¬ 
ing points of the algorithm makes us confident that we 
indeed found the global maxima. From our numerical 
results, we thus conjecture that the maximal violations 
of our causal inequalities achievable with qubit systems 
are 


Pgyot -2 « 0-5694 > i , (19) 

1 O 

max,ti=2 n omi max,d=2 . _ ° /on\ 

PlgVni « 0- 8194 = Pgyni + 4 > 4 • ( 2 °) 


K\o A ° = M o\o B ° ( 13 ) 

M A ^ Ao = M B q B ° = 2 |4> + )(4> + | , (14) 

M a [ a ° = M b [ b ° = 10X01 0 10X01, (15) 

M A [ Ao = M b [ b ° = |1)(1| ® 10X01, (16) 

with {|0), 11)} denoting the computational basis (i.e., the 
eigenbasis of Z), and |<i> + ) := (|00) + |ll))/\/2. These in¬ 
deed satisfy (8), and thus constitute valid instruments. 
These operations can be interpreted as follows: when 
their input is 0, Alice and Bob simply transmit their 
incoming physical system, untouched (2|$ + )(4> + | being 
indeed the CJ representation of an identity channel), and 
output the value 1; when their input is 1, Alice and 
Bob perform a measurement in the Z basis, whose result 
defines their classical output, and send out the fixed state 


In Appendix C we give an analytical description of the 
process matrices that reach these values. 

Going to larger dimensions, we found that the maximal 
value of pgyni could increase, as summarized in Table I 
for dimensions up to 5; however, we did not find any 
larger value for plgyni than PlcjynT 2 a 6° ve ; which— 
provided our See-Saw algorithm did find the global 
maxima—reveals some fundamental difference between 
the two inequalities, despite their similarities (and in ad¬ 
dition to the fact that contrary to GYNI, the outputs cor¬ 
responding to certain inputs are irrelevant in the LGYNI 
game; see also footnote 3). It remains an open ques¬ 
tion, which values are the true “Tsirelson bounds” [35] 
for these two causal inequalities, in the sense of the 
largest possible values for pgyni and plgyni reachable 
with quantum process matrices of any dimension. 


A necessary condition for a process matrix to allow for a causal 
inequality violation is that it is causally nonseparable [12]— i.e., 
that it is itself incompatible with a definite causal order. In the 
multipartite case this is known however not to be a sufficient 
condition [13, 14]. It remains an open question whether there 
can be bipartite causally nonseparable process matrices that only 
generate causal correlations. 


d 

2 

3 

4 

5 

max,d 

Pgyni 

0.5694 

0.6104 

0.6201 

0.6218 


Table I. Maximal values of pgyni found through numerical op¬ 
timization, as a function of the dimension of Alice and Bob’s 
input and output Hilbert spaces d = dA T = dA 0 = ds T = 
di3o • 








C. Boundary of the set 
of process matrix correlations 
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To finish with, let us picture the set of process matrix 
correlations vs that of causal correlations. In order to 
visualize the two, we shall project them onto the plane 
with coordinates (p(a = y),p(b = x)), where we again 
implicitly assume uniformly random inputs for ease of 
notations 7 . In this plane the complementarity between 
the two directions of signaling, from Alice to Bob and 
from Bob to Alice, is conspicuous; the projected causal 
polytope is bounded here by the four causal inequalities 8 
(see Figure 2) 


\p{a = y) + \p(b = x) < (21) 

\p{a = y) - ip(6 = x) < ^ , (22) 

-\p{a = y)+ \p(b = x) < i, (23) 

-\p( a = y)-\pi b = x ) < (24) 


To obtain a lower bound for the boundary of the set of 
process matrix correlations, we again used the See-Saw 
algorithm described in Appendix B to maximize quant¬ 
ities of the form ap(a = y) + /3p(b = x), with various 
weights a, fj. Different runs of the algorithm gave us dif¬ 
ferent lower bounds (recall that the See-Saw algorithm is 
not guaranteed to always find the global optimum), which 
we combined to obtain the bounds represented on Fig¬ 
ure 2 for dimensions d = 2, 3 and 4 of the input and out¬ 
put Hilbert spaces—and which we believe are close to the 
actual boundaries of the process matrix correlations for 
these dimensions. The largest violations of the causal in¬ 
equality (21) we found are \ p(a = y) + \ p(b=x) = 0.7715 
for d = 2; 0.7892 for d = 3; and 0.8001 for d = 4. 

A surprising feature of the set of process matrix cor¬ 
relations for dimension 2 is that it does not seem to be 
convex (see Figure 2, red region). We believe this is a true 
characteristic of it, not only a numerical artifact due to 



Figure 2. Projection of the probability space for binary in¬ 
puts and outputs onto the plane ( p(a=y),p(b=x)j (see main 
text). The causal polytope is projected onto the blue dia¬ 
mond, delimited by the causal inequalities (21-24). The red, 
green and purple regions correspond to process matrix correl¬ 
ations that are reachable with input and output Hilbert spaces 
of dimensions d = 2, 3 and 4, respectively. The outer dashed 
square delimits all valid probabilities p(a = y),p(b = x) £ [0,1]. 
Its upper right corner corresponds for instance to a correla¬ 
tion such that Alice’s output is always equal to Bob’s input 
(a = y) and Bob’s output is always equal to Alice’s input 
(b = x), which requires perfect 2-way signaling and violates 
Eqs. (6), (7) and (21) up to their algebraic maximum. This 
correlation may somehow be thought of as being analogous to 
the Popescu-Rohrlich (PR) box considered in the context of 
nonlocal correlations [37] (one difference, however, is that this 
correlation is deterministic, while the PR box correlations are 
not). 


some failure to find global optima. Note, nevertheless, 
that the boundary of the set of process matrix correl¬ 
ations for arbitrary dimensions is convex, as proven in 
Appendix D. 


IV. CONCLUSION 


7 Without assuming uniformly random inputs, p(a = y) and 
p(b = x) in Eqs. (21-24) and in Figure 2 should be replaced 

W i J2 x ,y,a,b Sa ’y P ( a ’ b \ X ’ y ') and \ J2x,y,a,b Sb ’ xP( - a,b \ X,y ^ r6 " 
spectively. 

8 Note that inequality (21) looks quite similar to Oreshkov et al.’s 
original causal inequality [12] (which motivated us in particu¬ 
lar to write it with the i factors). Oreshkov et al.’s inequality 
includes some additional conditioning on a second input bit for 
Bob. Averaging that inequality with its equivalent version where 
Bob’s second input bit is flipped yields inequality (21). Similarly, 
Brukner’s “Tsirelson-like bound” for Oreshkov et al.’s inequal¬ 
ity [36] (for a limited set of possible instruments) also yields the 

same Tsirelson-like bound ~ 0.8536 for inequality (21) 

(with the same restriction); there remains a large gap between 
that bound and the lower bounds we obtained numerically for 
dimensions 2, 3 and 4. 


We have shown that the set of correlations compat¬ 
ible with a definite causal order (“causal correlations”) 
forms a convex polytope, which we fully characterized 
in the simplest nontrivial bipartite scenario with bin¬ 
ary inputs and outputs. Two nonequivalent families of 
causal inequalities were obtained, Eqs. (6-7), for which 
we gave intuitive interpretations in terms of “causal 
games”. These allow for a device-independent charac¬ 
terization of correlations with or without definite causal 
order, and can be tested independently of the physical 
theory under consideration. We exhibited in particular 
violations of these inequalities by process matrix cor¬ 
relations, which generalize standard quantum correla¬ 
tions. Because of their simplicity (and despite the vi- 
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olations we found being somewhat less intuitive), we ex¬ 
pect these new inequalities— in particular the first one, 
interpreted as a “guess your neighbor’s input” game— 
to become archetypical examples of causal inequalities, 
just like the CHSH inequality is the archetype of Bell 
inequalities [17, 26]. 

Our approach can be used to characterize (non)causal 
correlations in more complex scenarios as well. It should 
be noted that because of the large dimension of the prob¬ 
ability space and the large number of vertices of the 
causal polytope (see Appendix A), the full facet enu¬ 
meration problem rapidly becomes intractable in prac¬ 
tice as the number of inputs and outputs increases bey¬ 
ond the simplest binary case. Nevertheless, one could 
adapt some of the tricks developed for the derivation 
of Bell inequalities (see Ref. [17] for a review) to con¬ 
struct new causal inequalities for various scenarios of in¬ 
terest. Violations of these inequalities in the process mat¬ 
rix framework can then be investigated using our See-Saw 
algorithm. An interesting question is whether it would 
also be possible to derive nontrivial bounds on such viol¬ 
ations from certain information-theoretic principles [38], 
along analogous lines to the research program that aims 
at restricting quantum nonlocal correlations from various 
principles [17]. 

In the present paper we focused for simplicity on the 
bipartite case. Our work can naturally also be extended 
to the case of more parties. With a proper generaliz¬ 
ation of the concept of noncausal correlations (see for 
instance Ref. [14]), it can also be shown that multipart¬ 
ite noncausal correlations form a convex polytope. Sim¬ 
ilar techniques can be used to characterize this polytope, 
construct causal inequalities and investigate their pos¬ 
sible violation. Note that a remarkable new feature in 
the multipartite case is that violations are also possible 
with “classical process matrices” [21]. 

One of the main open questions along the line of re¬ 
search presented here is whether it would actually be 
possible, in practice, to observe correlations with no def¬ 
inite causal order and a violation of a causal inequality. 
As an extension of standard quantum theory, the frame¬ 
work of process matrices—which does indeed predict such 
violations theoretically—appears as a good candidate to 
provide such a possibility. However, to the best of our 
knowledge, no practical implementation has been iden¬ 
tified for any of the process matrices that are known to 
violate a causal inequality [12, 20, 21] (including the ones 
presented here)—while in contrast, a causally nonsepar- 
able quantum process has been recently demonstrated 
experimentally [16]. It is likely that more complex scen¬ 
arios need to be considered, and a systematic investig¬ 
ation of causal inequalities and their violation by pro¬ 
cess matrix correlations might prove useful to find prac¬ 
tical violations—or, should it be the case, to clarify why 
such violations cannot be observed in practice. Our work 
makes the first crucial step in this direction. 

Note added. — While finishing writing up this manu¬ 
script, we became aware that the concept of causal poly¬ 


topes introduced here was also referred to (with proper 
reference to our work) in Ref. [14], where the emphasis 
was put on multipartite scenarios, and in Ref. [39], where 
the authors also introduced, for the multipartite case as 
well, larger polytopes of logically consistent but possibly 
noncausal classical processes. 
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Appendix A: Characterization of the causal polytope 

1. Vertices of the causal polytope 

As explained in the main text, the causal polytope, 
defined as the set of causal correlations of the form (3), is 
the convex hull of the polytope of correlations compatible 
with the causal order A -< B (which cannot signal to 
Alice) and the polytope of correlations compatible with 
the order B -< A (which cannot signal to Bob), so that its 
vertices are vertices of at least one of these two polytopes. 

Let us characterize the vertices of the polytope of cor¬ 
relations p AAB . Using Bayes’ rule, we can write 

p AAB {a,b\x,y) =p{a\x,y)p(b\x,y 7 a) = p{a\x) p(b\x 7 y,a), 

where the last equality follows from the no-signaling-to- 
Alice constraint (1). No further constraint is imposed 
on p(a\x) and p(b |x, y , a) (except that they must be valid 
probability distributions); these can be written as convex 
combinations of deterministic distributions, in the form 

p(a\x) = ^2 dad a , a { x ), 

Ot 

p(b\x, y, a) = ^2 'Z/3<5b,/3(z,y,a) , 
p 

with q a ,qp > 0, da = 9/3 = 1, where the a’s 

denote deterministic functions of Alice’s input x, and 
the /3’s denote deterministic functions of both Alice and 
Bob’s inputs x, y and of Alice’s output a. Combining 


them, we get 


3. For binary inputs and outputs 


p (fl, b\x, y') ^ ' Qa Q0 3a,a(x) ^b,0(x,y,a) 

a,(3 

^ v Q.c^Q/3 ^a,ci(x ) 3b,(3 a (x,y) 
a,(3 

^ v ^og/3' ^a, a(x) ^b, (3'{x,y) i 
oi,/3' 

where p a (x,y) = p(x,y,a(x)) and q ai/3 / = q a qy\ a 
with qp'\ a = J 2 /J Spc., 0 ' 9/3; such that q a ,p' > 0 and 
Ea 0 ' = 1- Hence, any correlation p A<B can be 

written as a convex combination of deterministic correl¬ 
ations compatible with the order A -< B —which thus 
correspond to the vertices of the corresponding polytope 
of correlations p A<B . 

If Alice and Bob’s inputs can take rriA and ms different 
values, respectively, and their outputs can take kA and 
ks values, resp., then there are k™ A different determ¬ 
inistic functions a(x) and kg ArnB functions P'(x,y), so 
that the polytope of correlations p A<B has k™ A x 
vertices. Note that kg B of the functions /3'(x,y) do not 
depend on x\ hence, out of all the vertices, k™ A kg B are 
non-signaling, while the other k™ A (k r ^ ArnB — k^ B ) are 
signaling to Bob. 

Similarly, the vertices of the polytope of correlations 
p B ^ A are k™ A m B jaub deterministic correlations com¬ 
patible with the order B -< A, of which k^k ^ 3 are non¬ 
signaling and are thus common to the previous polytope 
of correlations p A<B , while the other k^ B (k™ AmB —k™ A ) 
are signaling to Alice. The vertices of the causal polytope 
are all the deterministic correlations 9 compatible with 
either the order A -< B, or the order B -< A, or both— 
which makes a total of k™ A k 7 £ AmB + k™ AmB k^ B — 
k™ A k^ B vertices. 

2. Dimensions 

Because of the mArris normalization constraints 
Ea bP( a ’b\ x , y) = 1, the probability space of correl¬ 
ations p(a,b\x,y ) is of dimension mAmsikAkB —!)■ 
With the no-signaling-to-Alice and no-signaling- 
to-Bob constraints (1, 2), the dimensions of the 
polytopes of correlations p A<B and p B ^ A are re¬ 
duced to mAmB(kAkB-l) — ?7m(to,b —1)(^4~ 1) and 
mAmsikAkB —1) — {rriA— 1)uib(/cb — 1), respectively. 
However, the causal polytope— i.e., their convex hull- 
remains of the same dimension as the full probability 
space. 


9 Note that the fact that the causal polytope has deterministic 
vertices was not trivial a priori ; this contrasts for instance with 
the no-signaling polytope, which has non-deterministic vertices 
like the PR-box [37]. 


In the case where both Alice and Bob’s inputs and 
outputs take binary values, the 10-dimensional polytopes 
of correlations p A<B and p B<A both have 64 vertices, 
among which 16 are non-signaling vertices common to 
both polytopes. The 12-dimensional causal polytope thus 
has 64 + 64 — 16 = 112 different vertices. 

We enumerated the facets of the causal polytope for 
binary inputs and outputs by solving the convex hull 
problem using the software lrs [27]. As described in the 
main text, we obtained 48 facets, which can be grouped 
into 3 families of equivalent facets (up to relabelings of 
inputs and outputs). Explicitly, these are 

• 16 trivial facets of the form p(a,b\x,y) > 0 for all 
x,y,a,b = 0,1; 

• 16 facets of the GYNI type, which can be written 
(in the same form as (6), implicitly assuming uni¬ 
form input bits) as 

p{a® a\x ® a 0 = y,b® Pry ® Po = x) < 
for all a 0l ai,P 0 ,Pi = 0,1; 

• 16 facets of the LGYNI type, which can be writ¬ 
ten (in the same form as (7), implicitly assuming 
uniform input bits) as 

3 

p((x®ai)(a©a 0 ® 2 /) = 0, (y® Pi)(b® Po®x) = 0) < -, 
for all a 0 ,ai,P 0 ,Pi = 0,1. 

Note that this causal polytope for binary inputs and 
outputs coincides with the polytope of correlations ob¬ 
tained from a local model augmented with one bit of 
communication, as described in Ref. [40]. This is be¬ 
cause the use of just one bit of (one-way) communication 
is of course compatible with a definite causal order, either 
A -< B or B -< A, and for binary inputs one bit is enough 
for one party to send all the information about her input 
to the other party. In general however, the polytopes 
described in Ref. [40] are different from causal polytopes. 

Appendix B: See-Saw algorithm 

Maximizing the violation of a causal inequality over 
the process matrix and the instruments is a nonlinear 
problem, which makes it intractable directly. To address 
this problem, we used an approach inspired by the See- 
Saw algorithm of Werner and Wolf [34] . The idea is that 
if Alice and Bob’s instruments are fixed, then the com¬ 
bination of probabilities that enters the causal inequality 
is a linear function of the W matrix, and maximizing it is 
a semidefinite programming (SDP) problem [41] that can 
be solved efficiently. In the same spirit, if the W matrix 
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and the instruments of one party are fixed, then the value 
of interest is a linear function of the instruments of the 
other party, and again its optimization is a SDP problem. 
Hence, one can try to approach the maximum violation of 
a causal inequality by optimizing over the process matrix 
and the parties’ instruments in an iterative manner. 

More specifically, let u(W, A , B) be the value taken by 
the combination of probabilities in the causal inequality, 
considered as a function of the process matrix W and the 
sets of instruments A = {M A £ A °}a and B = {M B y B °}b 
(in their CJ representation). We start the algorithm by 
generating random sets of instruments Ao and Bo, and 
for these fixed instruments we maximize w considered as 
a function of W, via the following SDP problem: 

maximize cj(W,Aq, Bo) 
subject to 

W > 0, tr W = d Ao d B o , 

BrB 0 W = AoBjBoW , Al A 0 W = Ai A 0 B 0 W > 

W = Bo W + Ao W ~ AoBo W . 

With the optimal process matrix Wq thus obtained and 
the fixed set of instruments Bo for Bob, we proceed to 
optimize u as a function of Alice’s instruments, via the 
SDP problem 


V z, a , 


maximize u>(W q, A, Bq) 
subject to 

m AiAo > o , tr Ao V M A , lAo = t Al . 

a\x — 7 / _j a\x 

a 


With the optimal set of instruments Aq obtained now and 
the previously obtained process matrix Wo, we do the 
analogous optimization over Bob’s set of instruments B, 
and iterate the three optimization steps of the algorithm 
until it converges. 

One can see that at each step of the algorithm the value 
of u! can only increase, so it is guaranteed to converge to 
a local maximum. One does not, however, always get 
the global maximum, and in practice one must repeat 
the algorithm several times to get a good lower bound 
on the maximal value of u. 

Note that this See-Saw algorithm can of course 
straightforwardly be adapted to more than two parties. 


Appendix C: Maximal violations for qubits 

The best violations of our GYNI and LGYNI causal in¬ 
equalities that we found, for local dimensions d = d Al = 
d Ao = d Bl = d Bo = 2 (i.e., for “qubits”), is reached by 
any convex combination 


Q fYmax,d=2 + (1 <l) W / m ax,d=2 


(with 0 < q < 1 ) of the two process matrices W m ax,d =2 
and w L*,d=2 defined as 


W' 


(') 


:,d=2 


i l 04 + <4° ztzt - 4° (zttt+ttzt) 

-d:p (ZttZ+tZZt) + 4° ( ZtZZ+ZZZt) 
+4 ,} (Z1XX-Z1YY+XXZ1-YYZ1) 


(with implicit tensor products and implicit superscripts), 
where the coefficients 00 , 01 , 02 , 03 , and 04 are, respect¬ 
ively, real roots of the polynomials 


4 608 x 4 - 1575 z 3 + 525 z 2 - 117 z - 1, 

221184 z 4 + 142 479 z 3 - 19 701 z 2 - 15 603 z + 2 363, 
9 216 z 4 - 16 857 z 3 + 11 724 z 2 - 3 660 z + 430, 
221 184 z 4 - 50 895 z 3 - 16 200 z 2 + 1368 z + 602, 
221184 z 4 + 16 335 z 3 - 37 008 z 2 - 11400 z + 3 440, 


and the primed coefficients 03 , 0 ^, 02 , 03 , and a' 4 are, re¬ 
spectively, real roots of the polynomials 

4 608 z 4 + 8 595 z 3 + 5 583 z 2 + 873 z - 43, 

221 184 z 4 - 101601 z 3 - 1 701 z 2 + 2 745 z + 305, 

3 072 z 4 - 2 229 z 3 + 540 z 2 - 60 z + 4, 

221 184 z 4 - 294 975 z 3 + 145 080 z 2 - 31 224 z + 2 492, 
221184 z 4 + 16 335 z 3 - 37 008 z 2 - 11400 z + 3 440. 


Numerically, their values are 


o 0 = 0.2744, 
or = 0.2178, 
o 2 = 0.3628, 
o 3 = 0.3114, 


a' 0 = 0.0390, 
a[ = 0.3355, 
a' 2 = 0.2451, 
a' 3 = 0.4291, 


04 = a' 4 = 0.2097. 


Using the same instruments for Alice and Bob as 
those defined in Eqs. (13-16), our maximal probability 
Pgyni °f w i nn i n g the GYNI game with qubits is then 
found to be the smallest real root of the polynomial 


1 769 472 z 4 —2 884 032 z 3 +l 630 800 z 2 -380 052 z+34 087, 


and our maximal probability Plgyni 2 °f winning the 

LGYNI game with qubits is PlgynT 2 = Pgyni^ 2 + l/ d - 
Numerically, we obtain 


max,d=2 

P GYNI 


0.5694 > 


max,d=2 

Plgyni 


0.8194 > 


3 

4 ' 


It is somewhat surprising that these maximal viola¬ 
tions of the GYNI and LGYNI inequalities with qubits 
have such complicated expressions—contrary for instance 
to the case of Oreshkov et al.’s original causal inequal¬ 
ity (for which the violation exhibited in Ref. [12] was 
proven, under certain constraints, to be optimal [36]), 
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or to the case of the well known CHSH Bell inequal¬ 
ity [26, 35]. Note also that, as mentioned in the main 
text, we could find higher violations of the GYNI in¬ 
equality using higher-dimensional quantum systems (see 
Table I), while we couldn’t find any higher violations of 
the LGYNI inequality. 


Appendix D: 

The set of process matrix correlations is convex 


M t AlAo = | 0 )( 0 |^ ® M^ a fo + | 1 )< 1 | A ' ® M^ a f x ° , 


M, 


a\x 

B'jBjBo 

b\y 


1; a |a 


= |0)(0| S/ ® M^ B y ° + |1)(1| B ' <g> Ml 


b\y 


One can easily check that W thus defined is a valid pro¬ 
cess matrix, and that M A ^ AiA ° and define 

valid instruments. Furthermore, a straightforward cal¬ 
culation shows that for these process matrix and instru¬ 
ments, 


In this Appendix we show that the set of process mat¬ 
rix correlations is convex. 

Let po(a, b\x, y) and p\{a,b\x,y) be two correlations 
realized by the (valid) process matrices and instruments 
Wand re¬ 

spectively, so that 


p(a, b\x, y) = tr [(m£ A iA ° <g> M* BiB °) ■ W] 



= QPo(a,b\x,y) + (l-q)p 1 (a,b\x,y). 


Po(a, b\x, y ) = tr [{M AlA ° ® M B ' B °) • W 0 ] , 
Pi (a, b\x, y) = tr [{M^ A ° ® M^°) • Wi] . 


Without loss of generality we assume that 
Alice and Bob’s input and output systems in 
{Wo,Ml' A °,M 0 %°} and in { W 1 , } 

have the same dimensions (one can indeed always embed 
lower-dimensional systems into larger-dimensional ones). 
Let us now introduce some ancillary 2-dimensional input 
systems with Hilbert spaces TL Al and TL b i and define, 
for any q G [0,1], 


W = q |0X0| A ^ ® |0)(0| B ' ® Wo 

+ (1-5) |i)(i| Ai ® |ixi| BJ ® m , 


Thus, any convex combination qpo + (l—q)pi of process 
matrix correlations can also be realized with a process 
matrix and suitable instruments, which shows that the 
set of process matrix correlations is indeed convex. It is 
straightforward to generalize the proof to a scenario with 
more parties. 

Note that our construction only shows that the con¬ 
vex hull of the set of correlations produced by process 
matrices of dimensions x dA 0 xdBrXdBo is contained 
in the set of correlations produced by process matrices of 
dimensions 2^, x dA 0 x 2x ds 0 , opening up the 
possibility that the set of process matrix correlations is 
not convex for any fixed input dimension (see Figure 2), 
analogously to the set of (nonsignaling) quantum correl¬ 
ations for fixed dimensions [42]. 
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